Characterizing Optimal Sampling of Binary Contingency Tables via the Configuration Model
نویسندگان
چکیده
A binary contingency table is an m×n array of binary entries with row sums r = (r1, . . . , rm) and column sums c = (c1, . . . , cn). The configuration model generates a contingency table by considering ri tokens of type 1 for each row i and cj tokens of type 2 for each column j, and then taking a uniformly random pairing between type-1 and type-2 tokens. We give a necessary and sufficient condition so that the probability that the configuration model outputs a binary contingency table remains bounded away from 0 as N = ∑m i=1 ri = ∑n j=1 cj goes to ∞. Our finding shows surprising differences from recent results for binary symmetric contingency tables.
منابع مشابه
The University of Chicago Faster Markov Chain Monte Carlo Algorithms for the Permanent and Binary Contingency Tables a Dissertation Submitted to the Faculty of the Division of the Physical Sciences in Candidacy for the Degree of Doctor of Philosophy
Random sampling and combinatorial counting are important building blocks in many practical applications. However, for some problems exact counting in deterministic polynomial-time is provably impossible (unless P = NP ), in which case the best hope is to devise suitable approximation algorithms. Markov chain Monte Carlo (MCMC) methods give efficient approximation algorithms for several importan...
متن کاملEstimating the number of zero-one multi-way tables via sequential importance sampling
In 2005, Chen et al. introduced a sequential importance sampling (SIS) procedure to analyze zero-one two-way tables with given fixed marginal sums (row and column sums) via the conditional Poisson (CP) distribution. They showed that compared with Monte Carlo Markov chain (MCMC)-based approaches, their importance sampling method is more efficient in terms of running time and also provides an eas...
متن کاملPlain Answers to Several Questions about Association/Independence Structure in Complete/Incomplete Contingency Tables
In this paper, we develop some results based on Relational model (Klimova, et al. 2012) which permits a decomposition of logarithm of expected cell frequencies under a log-linear type model. These results imply plain answers to several questions in the context of analyzing of contingency tables. Moreover, determination of design matrix and hypothesis-induced matrix of the model will be discusse...
متن کاملExact Sampling and Counting for Fixed - Margin Matrices
The uniform distribution on matrices with specified row and column sums is often a natural choice of null model when testing for structure in two-way tables (binary or nonnegative integer). Due to the difficulty of sampling from this distribution, many approximate methods have been developed. We will show that by exploiting certain symmetries, exact sampling and counting is in fact possible in ...
متن کاملAnalysis of Dynamic Longitudinal Categorical Data in Incomplete Contingency Tables Using Capture-Recapture Sampling: A case Study of Semi-Concentrated Doctoral Exam
Abstract. In this paper, dynamic longitudinal categorical data and estimation of their parameters in incomplete contingency tables are evaluated. To apply the proposed method, a study has been conducted on the data of the semi-concentrated doctoral exam of the National Organization for Educational Testing (NOET). The results of studies such as the obtained confidence intervals and calculating t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Random Struct. Algorithms
دوره 42 شماره
صفحات -
تاریخ انتشار 2013